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At'  ,!  met. 


The  family  if  Infinitely  divisible  distributions  i . ’•  nm  to 

provide  alternative  formulations  in  several  inf  "rent  ia  situatd  ons. 
In  particular,  tin-  f nily  provides  heavy-tailed  distributions  and 
distributions  for  use  in  models  involving  ronvolutlons,  such  as 
signal-plus-noise  models.  Charae teriv.-t*  ion.-  of  sub-families  of  the 
infinitely  divisible  family  are  used  to  obtain  statistical  tests  of 
membership  in  those  sub-families.  Special  attention  is  given  to  th'- 
normal  and  normul-plus-Poisson  sub-families. 


[ 


1.  Introduction.  A random  variable  (r.v.)  is  said  to  be  infinitely  divisible 

(inf.  div.)  if  for  every  n = 2,3,...  thero  exin4-  independent  identically  distributed 

r.v.'s  X ,,  X ...  such  that  the  distribution  of  X ,+  X _+...+  X is  the  same 
nl  n2  (j  nl  rj?  nn 

as  that  of  X.  Thus,  letting  U * V mean  that  U and  V have  the  same  distribution, 
such  an  X can  be  expressed  in  terms  of  a triangular  array. 

X ~ X21  + X22 

X31  + X32  + X33 


S X 


hi 


+ X 


nbJ 


+ X 


nn 


fri  terms  of  the  characteristic  function  (c.f.)  f(u)  of  the  r.v.  X,  this  is  equivalent 
to  saying  that  for  each  n = 2,3,...  there  exists  a c.f.  f (u)  such  that 

f(u)  = [fn(u)]n. 


'■Research  supported  by  AFOSR  Grant  #76-3050. 
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Key  h'vrde  and  phi-.uje.i . Infinite  divisibility,  normality,  Poisson  distribution, 
cumulan is , convolution.; . 


1 


It  will  t-  f.hovm  tr  it  the  family  of  inf.  div.  distributions  is  a family  which  can 
b u-efu!  in  a iMubu'  of  mvi.-1  s,  er. pc  • tally  those  requiring  heavy-tailed  distributions 
and  those  in  which  the  oh.  • : variable  is  a convolution,  a sum  of  two  independent 

r . v . • s . 

The  family  oi'  inf.  div.  distributions  is  quite  broad.  It  includes  the  normal 
distributions,  as  w- •! 1 as  the  gamma  distributions,  the  related  exponential  and  chi-square 
distributions,  and  the  double-exponential  distributions.  It  includes  the  Poisson  and 
compound  Poisson  distributions  (distributions  generated  by  putting  a distribution  on 
the  Pol:  son  parameter),  though  here  we  shall  be  interested  primarily  in  continuous 
inf.  div.  distributions.  The  family  also  includes  the  generalized  Poisson  distributions 
(distributions  of  r.v.'r  which  are  sums  of  a Poisson-dis tributed  number  of  identically 
distributed  r.v.'s).  Some  distribution,  which  are  not  inf.  div.  are  those  with 
bound*  1 support,  those  whose  c.f.  vanishes  at  some  point  on  the  real  line,  and  those 
whose  c.f.  is  an  entire  function  which  vanishes  at  some  point  in  the  complex  plane. 

Another  way  to  demonstrate  the  breadth  of  the  inf.  div.  family  is  to  note  that 
each  of  the  following  families  of  distributions  contains  the  preceding:  normal  distri- 

butions, stable  distributions,  seif-decomposable  distributions,  inf.  div.  distributions. 

Relatively  recently  a number  of  researchers  have  used  stable  distributions  for 
modelling  various  phenomena.  Since  every  stable  distribution  is  inf.  div.,  the  inf.  div. 
distributions  can  be  used  wherever  stable  distributions  are,  and  the  result  is  a model 
which  is  less  restrictive  and  can  be  valid  under  more  general  circumstances. 

?.  Heavy-tailed  distributions.  A number  of  researchers  have  studied  stable 
distributions  [see.  e.g.,  DuMouchel  (1973,  1975).  Fame  and  Roll  (1971)]  because  they 
are  "heavy-tailed."  A primary  motivation  for  such  studies  is  the  observation  of 
economists  that  the  distributions  of  changes  in  stock  prices  seem  to  be  rather  heavy- 
tailed. 

Stable  distributions  are  indeed  heavy-tailed.  In  fact,  the  only  stable  distribution 
with  finite  variance  is  the  normal  distribution.  It  is  acknowledged  [see,  e.g., 

DuMouchel  (1973),  p.  ^9]  that  it  is  not  necessary  t.o  use  infinite-variance  distributions 
in  order  to  provide  heavy-tailed  distributions. 

If  fact.,  all  infinitely  divisible  distributions  are  heavy-tailed.  For,  as  will 

bo  shown  below,  their  fourth  cumulant , , is  necessarily  non-negative.  Thus,  letting 

u denote  the  r-th  central  moment , we  have 
r 
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t 

= /(x  ~ p)‘W(x)  - y?  Sir.  - p)'dF(x) 

= 5 l ( x— u ) - 3o^(x-p)'  ]'1F(x), 

p 

vher  F(x  d<  >tes  tl  1L  ‘ibution  function  and  o - p Is  the  vari an  :e.  Thus 

dc-'(x)  weight:;  relatively  hea.-.ly  those  points  x For  which 

( x - p)1’  - 3->?(x  - p )?  > 0; 

i.e. , 

| x - p | ^ . 

Trias  F(x)  corresponds  to  n relatively  heavy-tailed  distribution.  More  precisely, 
a normal  distribution  has  >.  ~ 0,  so  a distribution  with  positive  fourth  cumulant 
is  heavy -t  ail  ed  relative  to  th?  normal. 

It.  remains  to  si  >w  that  th'  fourth  cunulant  is  non-negative. 

THEOREM  2.1.  If  X is  inf.  div.  with  finite  fourth  moment,  then  the  fourth 
cwul  rr  b is  non-negat i:>e. 

PROOF. [Cf.  Pi.erre(l969) , p.  320.]  Since  X has  finite  fourth  moment,  the  fourth 
cunulant  exists  and  is  given  by 


(2.1) 


<j4  = (d  /du’)log  f(u)|u_Q, 


where  f(u)  is  the  c.f.  of  X.  Since  X is  inf.  div.,  the  logarithm  of  f(u) 
can  be  written  [see.  e.g. , Loeve(l9o3),  p.  293]  as 


(2.2) 


lof  f(u)  = iup  + /[exp(iux)  - 1 - iuxlx  ’dK(x), 


where  K(x)  is  monotone  increasing  and  of  bounded  variation,  K(-»)  = 0,  K(“) 

- Var(X)  < °°,  arid  p = E(X).  The  integrand  is  defined  at  the  origin  by  continuity. 
Now,  from  Lo^ve(l963),  p.  293,  one  sees  that 


(2.3) 


- (d"/du^)log  f(u)  = Texp(iux)dF(x)  . 


Tr,  ■ the  Left-hand  side  of  (2.3)  is  a c.f.  Since  this  c.f.  is  twice  differentiable, 
its  second  derivative  in  given  by  [Lo6ve( 19^3) , p.  200] 


Thus,  by  (2.1), 

(2.U) 


(d  /dulj)log  f(u)  = Ir.‘  exp( iux)dK(x)  . 


Kj  = fx  dK(x)  >_  0 


v du* 


(1.1- 


...  Lder  the  i h 
X - Y h Z, 


v'  t Y ■!  1 7.  ■ • leperrlent,  non-1  dent i cally  di atr  Lbs  ru’d  an!  individu- 
ally not.  • r.v.  has  a distribution  in  a parametric  family  {P  } 

and  Z h i •;  ’■  i . ..  ther  parametric  family  {Q  1 . Thus  the  family  c C 

Ol) 

disti  ibul ions  for  th  >bs  r\  . r.v.  X is  a family  of  convolution  distributions 
{ * '• , wh  e * solution  operation  of  distribution  functions. 

'!  , V to 

Vhi  s ij  c l .-•'••»{]  by  £ • uve  and  Van  Ryzir.  ( '!  97P ) • They  show  that,  when  Z is 

:i  screte,  un-  •'  1a  ion  tl  ? par  an  iters  8 and  becomes  intract- 
able ad  1 juj  i illj  od  o mom  nt  » offers  solution. 

Any  .;i  il-plu.  .is  ; ••  ial  i u of  tie-  form  (3.1).  Th*“  model  vith  discrete  signal 
7.  c . o ■ c ir  in  any  conn  tin;;  pr  • -s:;  whore  t.'-e  count  is  recorded  as  a measured  electri- 
cal  vl  i r isull  . fro:  tl  .•  actual  couni  plus  an  error  introduced  by  electrical 

n<  i i in  the  c >unl  ing  m >eh  riism.  Another  application  arises  in  the  problem  of  estimating 
tl  mean  1 • . i >f  rirui  r bacteria)  in  a homogeneous  solution  where  the  "count." 

X is  measured  ns  the  nr  / < n a slide  occupied  by  the  viruses  where  each  virus  occupies 
a unit  of  area,  lie-  re,  the  trial  area  is  Y + Z where  Z is  the  number  of  viruses 
and  Y is  the  sum  the  deviations  from  the  ideal  (on’  virus  per  unit  area)  plus 
error  in  the  r-  a..:  sent  of  • roa.  In  many  such  applications  it  is  reasonable  to  take 

i-he  distribution  of  the  coni  inu.-us  variable  Y to  be  normal  and  that  of  the  discrete 
variable  Z to  be  Poissoi  r >isson— related  (e.g. , negative  binomial  or  some  other 
compound  Poisson  distribution). 


Now,  if  X = Y + Z,  where  Y is  normal  and  Z is  Poisson  or  compound  Poisson, 

then  X is  inf.  div.  In  fact,,  the  r.v.  X is  inf.  div.  if  and  only  if  (2.2)  holds. 

Tne  integrand  in  (?.2)  is  defined  by  continuity  at  the  origin.  Hence,  since  the  limit 

—2  2 

as  x tends  to  zero  of  the  integrand  [exp(iux)  - 1 - iuxlx  is  -u  , we  have 
(3.2)  log  f(u)  = iup  - + /[exp(iux  - 1 - iux]x  ^dM(x)  , 


wher-*  (S  is  the  jump  of  K(x)  at  the  origin  and  M(x)  lias  no  mass  at  the  origin. 

d 2 2 

Tins  is  equivaler  ' to  X = u + Y + Z,  where  Y has  log  c.f.  equal  to  -u  6 and 

lienee  is  norma  L (with  Var(Y)  = 2<S  ] , and  Z has  ]of  c.f.  equal  to  the  integral  in 

(3-.  ).  The  r.v.  Z is  call ‘ i the  "Poisson  component"  of  X or  is  said  to  be  of 

"Poisson  type."  Thus  the  convolution  model  (3.1)  with  Z suitably  distributed  leads 

to  an  inf.  div.  X.  Conversely,  every  inf.  div.  X obeys  a convolution  model. 
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It  was  noticed  by  Borges  ( 1S66)  and  later  independently  by  Pierre  (1967) 
that  nullity  of  the  fourth  cumulant  characterizes  the  normal  distribution  in 
the  class  of  inf.  div.  laws. 

THEOREM  U.l.  An  inf.  div.  distribution  is  normal  if  and  only  if  it*  fourth 
cunulant  is  zero. 

PROOF.  If  X is  normal,  then  its  c.f.  f(u)  satisfies 

p 2 

log  f(u)  = iuu  - u‘6  , 

30  that  in  the  expansion  of  log  f(u)  all  terms  of  degree  greater  than  two 
vanish.  The  r-th  cunulant  is  the  coefficient  of  i u /r!  in  the  expansion. 

Hence  all  eunulants  of  order  greater  than  two  vanish.  Conversely,  suppose  X 
is  inf.  div.  and  has  zero  fourth  cumulant.  By  (2.2)  and  (3.2),  X is  normal  if 
and  only  if  K(x)  increases  only  at  x = 0.  Hence  it  suffices  to  prove  that 
K(x)  increases  only  at  x * 0.  But  = 0,  so  this  is  immediate  from  (2.U). 

Using  this  characterization  of  the  normal  distribution  among  inf.  div.  laws, 
one  can  construct  a test  of  the  hypothesis  that  an  inf.  div.  r.v.  is  normally 
distributed.  The  hypothesis  to  be  tested  is 

H:  X is  normally  distributed,  given  that  X is  inf.  div. 

The  alternative  hypothesis  is  not  H:  X is  inf.  div.  but  is  not  normally  dis- 
tributed. The  hypothesis  is  equivalent  to  <^  = 0. 

An(unbiased)  estimator  for  is  [Kendall  and  Stuart,  p.  28l,  (12.29)1 

k^  * n2[(n+l)m^  - 3(n-l)m22]/[(n-l)(n-2)(n-3) ], 

where 

mr  = Ei=l(xi  ' x)r/n 

is  the  sample  analogue  of  u (and  is  a biased  estimator  of  u ) . The  statistic 

r r 

k./o(k,),  where  o(S)  denotes  the  standard  deviation  cf  the  statistic  S, 
i3  asymptotically  normally  distributed  under  H;  so  is  k^/s(k^),  if  s(k^) 
is  a consistent  estimator  for  o^).  The  variance  of  k^  is  relatively  compli- 
cated [Kendall  and  Stuart  (1969),  p.  290,  (12.37)1,  bat  under  the  hypothesis  of 
normality  it  reduces  simply  to 
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o2(k^)  = U8o^/(n-l}^ 

[Kendall  and  Stuart  (1969),  p.  296,  (12.71)];  a consistent  estimator  for  this  is 

p _ ^ p p 

obtained  by  substituting  s = I.  (x.-x)  for  a . (Alternatively,  one  could 
8 * ^ 8 

replace  s by  an  unbiased  estimator  for  0 .)  At  level  a one  rejects  the 
hypothesis  of  normality  if 

i kj^/s ( ) | > z(a/2), 

where  z(p)  denotes  the  upper  p-th  percentage  pcint  of  the  standard  normal 
distribution. 

5.  Testing  for  normal-olus-Poisson.  Of  special  interest  in  the  convolution 
model  is  the  case  in  which  Y is  normal  and  2 is  Poisson.  Sclove  and  Van  Ryzin 
(1971)  give  parameter  estimates  for  such  special  cases.  Before  applying  their 
results,  it  would  be  desirable  to  test  the  adequacy  of  such  a special  model. 

The  adequacy  of  the  normal-plus-Poisson  assumption  can  be  tested  against  the  under- 
lying assumption  that  X is  inf.  d*'''.  For  this  we  need  the  following  theorem 
[Pierre  (1971),  p.  3^8 ] . 

THEOREM  5.1.  Suppose  X ts  inf.div.  Then  X = Y+Z,  where  Y is  normal  and 
Z is  ordinary  Poisson , if  and  only  if 

(5.1)  Kg  - 2<5  + = 0. 

PROOF.  The  c.f.  of  a Poisson  r.v.  with  parameter  y has  logarithm  equal  to 

(5.2)  y [exp(iu)  - l]  . 


The  terms 


iuy  + /[exp(iux)  - 1 - iux]x  dM(x) 


of  (3.2)  are  of  the  form  (5-2)  if  and  only  if  M(x)  increases  only  at  x = 1. 

Now  suppose  (5-1)  holds.  Then 

0 * k6  “ 2k5  + 

* (d^/du^)  log  f(u)  |u=0  -2(d5/du5)  log  f(u)  |u_Q  + (d^/du^)  log  f (u)  |^_0 

* /x1*dM(x)  - 2/x^dM(x)  + 

* /x2(x2  - 2x  + l)dM(x) 

* /x2(x-l)2dM(x)  . 


/x2dM(x) 


i 


7 

Thus  M(x)  increases  only  at  x = 1 [an  increase  at  x = 0 having  already  been 
excluded  in  replacing  K(x)  by  M(x)],  and  so  Z is  ordinary  Poisson.  Conversely, 
if  X = Y ♦ Z,  where  Y is  normal  and  Z is  Poisson  with  parameter  u,  then 
the  log  c.f.  is  given  by 

log  f(u)  = -u252  + u[exp(iu)  - l] 

= iuu  + i2u2(262+u  )/2  + i^u^u/3!  + i**u**u/U!  + ... 

so  that 

2 

(5.3)  •c1  = U,  <2  3 26  +u  , = U * 3 ...  • 

In  particular, 

Xg  - 2x^  + x^  = u - 2u  + u = 0. 

An  estimate  of  6 = Xg  - 2x^  + x^  is  b = kg  - 2k,.  ♦ k^,  where  these 

k-statistics  are  given  in  Kendall  and  Stuart  (1969),  page  280,  (12.28).  Let 

2 2 
s (b)  be  a consistent  estimator  for  c ;b).  At  level  a one  rejects  the 

normal-plus-Poisson  hypothesis  if  jb/s(b) j > z(a/2).  The  quantities  Var( k^) , 

Var(k^),  Var(kg),  Cov(k^,k^),  ar.d  Cov(k^,x^)  needed  to  compute  Var(b)  are 

given  in  Kendall  and  Stuart(l969) , rages  290-29^.  [Unfortunately,  Cov(k..,k-) 

0 o 

is  not  given.]  These  formulas  are  complicated  but  could  be  simplified,  using 

(5.3) ,  to  provide  an  expression  for  a (b)  under  the  normal -plus-Poissor.  hypo- 
thesis. This  expression  will  involve  only  E(Y),  Var(Y),  and  E(Z),  which  could' be 
estimated  unbiasedly  by  formulas  provided  by  Sclove  and  Van  Ryzin  (1971).  These 
estimates  could  then  be  substituted  into  the  expression  for  o(b)  to  provide 

the  required  consistent  estimate  s(b). 

An  alternative  approach  is  subsampling.  One  partitions  the  sample  into 
several  (say,  t)  disjoint  subsamples  and  computes  an  estimate  b from  each. 

Let  bj,  j * l,2,...,t,  denote  the  subsample  values.  Define 

* ■ 

and  take 

s2(b)  = ZJ^1(bJ  - b)2/[t(t-l)]  . 

Then  the  test  statistic  is  b/s(b)  . One  needs  to  take  t large  enough  so  that 
approximate  normality  of  b can  be  used. 
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6.  Testing  for  normality,  given  normal-pluc-Pcioscn.  The  presence  of  the 
Poisson  component  Z could  affect  adversely  the  power  of  the  test  of  normality 
of  Section  4.  Accordingly,  it  cakes  sense  to  consider  testing  nested  hypotheses 

in  sequence.  One  first  tests  the  hypothesis  of  Section  5,  viz.,  <g  - 2<c  + = 0. 

If  this  hypothesis  is  rejected,  one  steps  and  retains  the  full  model. (The  nature 
of  the  component  Z is  not  then  further  specified.)  On  the  other  hand,  if  this 
hypothesis  is  accepted,  one  then  tests  the  hypothesis 

•c^  * 0,  given  that  Kg  - 2k,.  + 3 C. 

This  is  logically  equivalent  to  the  hypothesis 

*6  ” 2|C5  = °* 

The  statistic  c 3 kg  - 2k^  is  an  unbiased  estimator  for  Kg  - 2<^  . A test 
statistic  is  c/s(c),  where  s(c)  is  a consistent  estimator  for  o(c).  Either 
of  the  approaches  of  Section  5 could  be  used.  The  hypothesis  of  normality  would 
greatly  simplify  the  expression  for  a(c). 

7.  Testing  for  a Poisson  distribution.  Though  we  have  focused  on  continuous 
r.v.'s  X,  it  is  of  interest  to  note  how  one  can  test  the  hypothesis  that  X is 
Poisson  (i.e.,  Y is  zero  and  Z is  ordinary  Poisson).  The  r.v.  X is  Poisson 
if  and  only  if  K(x)  can  have  a jumpt  only  at  x = 1.  This  corresponds  to 

0 3 /(x  - l)2dX(x) 

3 /x2 dK(x)  - 2/xdK(x)  + /dK(x) 

3 (d**/du  ) log  f( u) !u_q  - 2(d3/du3)  log  f(u) |u=0  + (d2/du2)  log  f(u) |u_Q 

3 K.  - 2k  + K_ 

432 

* a , 

say.  Let  d = - 2k^  + k^  he  the  k-statistic . estimate  of  3.  Then  a test 

statistic  for  the  hypothesis  of  a Poisson  distribution  is  d/s(u),  where  s(d) 
is  a consistent  estimator  for  o(d).  Again,  an  alternative  approach  is  provided 
by  subsampling. 


tL__ 
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ABSTRACT  (Continue  on  reverae  aide  If  neceaamry  end  identify  by  block  number) 

^The  family  of  infinitely  divisible  distributions  is  shown  to  provide  alterna- 
tive formulations  in  several  inferential  situations.  In  particular,  the  family 
provides  heavy-tailed  distributions  and  distributions  for  use  in  models  invol- 
ving convolutions,  such  as  signal -plus-noise  models.  Characterizations  of  sub- 
families of  the  infinitely  divisible  family  are  used  to  obtain  statistical  ' 
tests  of  membership  in  those  sub-families.  Special  attention  is  given  to  the 
normal  and  normal-plus-Poisson  sub-families.^ 
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